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Abstract 

The exact distribution of the square sum of Dirichlet random vari- 
ables is given by two different univariate integral representations. Al- 
ternatively, three representations by orthogonal series with Jacobi or 
Legendre polynomials are derived. As a special case the distribution 
of the square sum U% of spacings - also called Greenwood's statistic - 
is obtained. Nine quantiles of nll^ — 1 are tabulated with eight digits 
for n from 10 to 100. 
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1 Introduction and Notation 

Greenwood's statistic is denned by 



fc+i 

i=X 

where the U i: k denote the successive ordered values of a random sample 
U\, . . . , Uk from a uniform distribution on (0, 1) with U$± = and Uk+uk — 1- 
If F denotes any completely specified continuous cdf then, with a random 
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sample Xx, . . . X k and Ui = F (X i ), the statistic U% +1 can be used for a test 
of the hypothesis that the unknown cdf F of X coincides with F . For the ef- 
ficiency of this test see e.g. Pyke (1965). For the application of Greenwood's 
statistic by a test for exponentiality see e.g. Subhash and Gupta (1988). 

The computation of the distribution of U% has a long history beginning 
with Moran (1947) and (1951), Gardner (1952) followed by Hill (1979), Bur- 
rows (1979), Currie (1981), Stephens (1981) Currie and Stephens (1986), 
Does et al. (1988), Ghosh and Jammalamadaka (1998) and (2000). The ex- 
act methods of Burrows, Currie and Stephens use iterated integrations up to 
n = 20. The subsequent papers are concerned with approximations mainly 
by Edgeworth expansions and saddle-point methods. It is well known that 
the distribution of the standardized U% converges very slowly to a normal 
distribution. For the asymptotic normality of a more general class of statis- 
tics, based on spacings, see Darling (1953). A comparison of the approximate 
quantiles with the available "exact" quantiles for small sample sizes e.g. in 
Ghosh and Jammalamadaka (2000) shows rather unsatisfying discrepancies. 
However, thanks to the increasing computing power of computer algebra sys- 
tems, computations are available based on exact representations of the cdf 
of Greenwood's statistic. Thus errors are reduced solely to those of the nu- 
merical evaluation. 

In this paper more generally some exact representations are given for 
the distribution of the square sum U% of Dirichlet random variables derived 
from gamma distributed variables of order a. This statistic can be used as a 
competitor with the maximum likelihood statistic for the test of a specified 
value «o of the unknown order parameter a of a gamma distribution with 
any unknown scale parameter. For the maximum likelihood estimation of 
a see Provost (1988) and chapter 17 in Kotz et al. (1994). Greenwood's 
distribution is obtained by the special case a — 1. In section 4 the cdf of U n 
is represented by orthogonal series with Jacobi or Legendre polynomials. In 
section 5 two different univariate integral representations for the cdf of U% 
are found, derived by the Fourier and Laplace inversion formula. Theorem 
2, the main result, provides also the more general case with gamma random 
variables of different orders by a univariate integral over a product of 
parabolic cylinder functions. Theoretically, the integral representations look 
more elegant since no special coefficients have to be computed before. For 
the orthogonal series a sequence of moments must be computed, but the 
computing effort can be reduced to two linear recursion formulas. The eight 
digit quantiles of Greenwood's statistic for n from 10 to 100 in section 6 were 
computed by the orthogonal series in (14. 6 j) and again by (14.71) . In spite of 
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the very high computing accuracy, required to achieve the eight digit quantile 
values, this method works more quickly than the integral representations in 
section 5. 

Now let X±, . . . , X n be i.i.d. random variables with a gamma pdf 



9a{X) 



(r^j^i^and G a (i)= f g a (Qd£, , a > , x > . (1.1) 

Jo 



The "sample triangle" in M. n generated by Xi , . . . , X n , has the vertices 
(0, . . . , 0), (Xi, . . . , X n ), (X, . . . , X), with the sample mean X and the an- 
gle $ between the vectors (X\, . . . , X n ) and (1, . . . , 1). We use the further 
notations 



Y = ^2x i = nX , Z = R 2 = x t 



-2 

t=l i=l 



U = § = —L_ Q = R 2 -nX 2 = nS 2 . (1.2) 
Y y/ncos® 

The random variables Y^ — X^/Y, i = 1 , . . . , n — 1 have the joint Dirichlet 
density 

I 1 - Ei=i rii=i ^ • 



(r(a))' 

Thus, the distribution of the square sum U 2 = Y 2 , Y\ + . . . + Y n — 1 
is closely related to the distribution of the squared sample coefficient of vari- 
ation 

S 2 /X 2 = tan 2 ($) =nU 2 -l. (1.3) 

By Laha (1954) and Lukacs (1955) the independence of X and S/X was 
shown to be a characterization of the gamma distribution. We shall use the 
independence of U n and Y = nX. 

Apart from some special notations as e.g. in (11. ip the cdf, pdf, char, 
function (cf) and the Laplace transform (Lt) of any continuous random vari- 
able Z will be denoted by F z , fz, fz an d f* z respectively. E z (- ■ ■) means 
expectation with regard to Z. The difference operator is denoted by A, i.e. 
Adfc = a>k+i — ak for any sequence (ofe) and (c)k = Tli=o( c + Formulas 
from Abramowitz and Stegun are denoted by A.S. and their number. 



2 The distribution of R and Q 

These distributions were already given in Royen (2007a), (2007b). The coef- 
ficients, needed for a series representation of the cdf Fr of R, are also used in 
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the subsequent sections. Therefore, this representation is derived here more 
concisely. 

For a given power series f(z) = Efclo afc2;fc ' a o > 0> the coefficients a n> k of 
{f{z)) n can be computed by two steps: Setting 

log(/(z))=log(a ) + Er=i^ fe A, 
the relation 

fit) = ZZ (k + l)a k+ iz k = f(z)Y:Zoh + iz k 
= Er=o (H^jh+i-j) z k implies 

b k +i = Oq 1 ((& + l)ojfc + i - Ej=i a A+i-j) , € N , and 

a>n,o = Oq , an,fc+i = j^i X^=o a «,A+i-j , ^ e N . (2-1) 
Let be C = cos($). The moments 

lan+k := E (C an+k ) (2.2) 

depend on a and n not only by an, but we write jan+k for simplicity instead 
more precisely •y a ,n,an+k- With U~ l = y/nC it follows from the independence 
of U and X that 

F fl (r) = E u (G m (rU- 1 )) 

j? rr» Ur> ^ (ry^)" n ^ 7 aw+fc (-ry/n) k 
= E c (G an (rCV^))=^ V ^ an ■ (2.3) 

v 7 fc=0 

On the other hand we have with 
r((q + fc)/2) 
= 2 r(a)Jfc! U 

= (/i>(t)) B =(t- a/a E^,(-i)V**-* /2 ) n 

= t- an / 2 J2Zo(-l) k a a ,n, k r k/2 , (2-4) 

where the a a)U ^ are computed from the a a ^ by the above procedure leading to 
( 12. ip . With Z = R 2 we obtain from ( 12. 4 p by inversion, followed by integration 

oo 

W-^ r(uM t )fl ( ^' (2 ' 5) 

and by comparison with (I2.3P 

2r(an)fc! 

Wfc - T((an + k)/2)n(<™+ k )/i aa > n ' k ' (2 ' 6) 
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Finally, from the density 



f R (r) = E$ (v/ncos$ • g an (ry/n cos®)) 

= V^g a n(rVn)E* ((cos^e^ 1 -^) 

OO / i \ 



k=0 



k=0 



we obtain F R as a convex combination of gamma distribution functions: 

OO 

F R (r) = Y,P a ,n, k G an+k (r^), p a , n ,k = ( cm+ fe fc ~ 1 ) (-A) fc 7cm . (2.7) 

With gamma(a:n+ /^-distributed random variables Y an+k and a random index 
if with P{K = k} = p a ,n,k this can be restated more concisely as a stochastic 
representation: 

R is distributed as n~ l l 2 Y an+K . (2.8) 

The distribution of Q = nS 2 could be derived from the distribution of U 2 , 
but we have directly with 

K a (x v) = T HCk{y) ^ 
Z.r(/3 + A;/2) fc! ' 

where He k denotes the Hermite polynomials (A. S. 22. 2. 15), and the parabolic 
cylinder functions D_ a the integral representation 

F Q (x) = -j= (-) y K an/2+1 (^/2,y)(D„ a (-y/^)) n e-y , (2.9) 

which was given in Royen (2007b) with slightly different notations. Further 
representations of Fq are found in Royen (2007a). 
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3 Some Laplace transforms 

If X has a gamma density g a , then the bivariate Lt of (X, X 2 ) is given by 

E (-sX-tX*\ = 1 x a-l X (l+ S )/Vt dx 



— Y 



X) T ((a + k)/2) ( 1 + s x 



1 exp ( V Jl±£ ] (3.1) 



with the parabolic cylinder function D_ a from (A. S. 19. 3.1). If a £ N then 

^ gHl) ^ ^ (-sH- (t) •* (I)) ««> 

and in particular 

Z 2 \ „ , , /7T /-2 2 \ „ / -2 



exp( T j D_ 1 (z) = ^-«p (-J er/c 



/From (OD, (E2D, (J2IZD we obtain 



P{UY <r}= Fuiry-^Mdy 
Jo 

' ' (1- Fcin-^r-^y^e-vdy 



F R (r) 



F(an 

= [\l-F c (x))x an - 1 e- xr ^dx. 
T(an) J 

Thus, 

00 

iXcmK^F* (n" 1 ^) = r(«n) S - an ^p Q , n , fc G an+fc ( S ) (3.3) 

fc=0 

is the Lt of x an ' 1 (l - F c (x)), < a; < 1. (i^fx) = i/ x < n" 1 / 2 ). 

Also the Lt of C = cos(<&) can be represented by means of the probabilities 
Pa,n,k from (12.71) . If Z 1 denotes a r.v. with density e~ z , independent of U, 
and A an _i a r.v. with a beta(l, an — l)-distribution, independent of R, then 



E ( x L-i) {an-l)B(k + l,an-l) T(an) 



E{Z\) k\ T(an + k) (an), 
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Comparing this with E{R k ) = E{UY) k = E{U k )E(Y k ) = E(U k ){an) k it 
follows that U Zi and RX an _i have the same moments which determine the 
distribution completely. Thus 

UZ\ is distributed as RX an -i (3.4) 

and consequently the Lt of C = cos($) is given by 

= E (e~ sC ) = E (e~ s ^ u A = P{UZ X > sn' 1 / 2 } 
= P{RX an ^ > sn- 1 ' 2 } = P{X an _ x Y an + K > s} 
= J2kLaP<*,n,k / s °°(l - sy-^^gan+k^dy 

= e~ s EZo ((-A) fe 7 Q n) h. f °°(y + s) k 9 an (y)dy 
= e-°ZZo ((-A) fc 7«n) £? „( /'/ >'/./! 

^From the binomial series we obtain the mgf of \og(y/nU) : 

E(V^Uy = £(cos($))- s = E (- ( cos ( $ ))"" ^ 

v ; v v ;y \(1 - (l-cos($))) an +V 

_ v^°° fY_A^ fc 'v \Kt£k - V°° n , ( Qn + S )fc fq 
— Z^fe=oU 7anJ fe! — Z^k=0Pa,n,k ( Qn ) fc , ^3.UJ 

following also from ([23]) and fi og Y an+K = /io g y/io g (Vit/)- 

In particular for a = 1 the "missing moments" 7 m , m = 1, . . . n — 1, are 
given by 

7m = 7n + Lfc=lPl,n,fc ( n)fc °r 7m- 1 = Lfc= (~ A ) 7m , m = 71, 71-1, . . . 2 . 

(3.7) 

The inversion formula, derived from the Bernstein polynomials, (see e.g. 
Feller (1971) would already provide 

F c (x) = hm £, <nx © (-A)"~* 7fc , (3.8) 

n— >oo — 

but for a satisfying accuracy a very large n would be required. Therefore, 
some series expansions with orthogonal polynomials will be given in the sub- 
sequent section. 
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4 Some representations of the distribution of 
U by orthogonal polynomials 

The square integrability of the densities fc and fu is equivalent and gua- 
rantees the uniform convergence of the following orthogonal series. The con- 
dition ( 14. 4p of the subsequent lemma is sufficient for the square integrability 
of it/. 

Lemma 1. Let Y±, . . .Y n be random variables with Y^=i ^ = ^ an< ^ a 3°^ 
Dirichlet distribution with the parameters ai,...a n . Then the cdf Fu of 
U = (Y2i=i ^j 2 ) 1//2 satisfies the following relations: 

Fu K /2 + e) * * { °] S ^T^ ni3n ~ m ~ a ■ £(n ~ 1)/2 ' ^ 
rii=i r («i) T ((n + l)/2) 

n C 2 ~\ 

•£p\Y i >l-E--^- T \<l-F u (l-E)< 



1=1 



<^ P { Y ^> 1 2 + l( 1 -^ + ^ 2 ) 1/2 } > ( 4 -2) 



< e < 1 - 2~ 1/2 , where the Y t are beta(aj, a — ai)-distributed. 

1 - F v (l - e) ~ * {Aa * = _ Qmax} ■ £ "J amaX , e^O. (4.3) 

The density /[/ is square integrable if n > 2 and a — a max > 1/2 . (4.4) 

Remark: For identical on = a the a in the lemma has to be replaced by an 
and we obtain the conditions 

fu is square integrable if n > max ( 2, 1 + — 



2a 



fu{n- 1/2 ) = Ml) = if > max ^3, 1 + 



(4.5) 



Proof. With Xi = (1 + £j)x = (1 + £j)yn 1 we have tan 2 (<£>) = - J^iLi £ ? an d 
consequently with t — > : 

P{tan($) < = (nr=ir(«,))- 1 / E^xe-arr 1 ^ * 

Jtan($)<i 

/•oo 

- (rnUr^r 1 / e-^yn-'r-^yn-^r-'n-^dy ■ t n ~ l = 

Jo 
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where b n -\ denotes the volume l ^ 2 {V ((n + l)/2)) 1 of the (n — l)-unit 
ball. Thus, (14.1 p follows from 

P{U < n~ 1/2 + e} = P (tan 2 ($) < 2e^l + ne 2 } ~ P |tan($) < n 1/4 V2e} 
Now let be Y n = maxYj and < 5 < 1/2. Then 

r2 \ 1/2 



((1 - 5) 2 + -^j <[/<((!- 5) 2 + 5 2 ) 1/2 



Conversely, from Y n = max Yj and U = 1 — e , e < 1 — 2 x / 2 , it follows that 
K, = 1 — 6 with 



1 „ , , n — 1 n — l/ 2n 



\ 1/2 

£ + — ^ — -£ 2 <5i(V):=- — --- — - ( 1 - -^-e(l - e/2) ] < 
2(n-l) v ; n n \ n- 1 v 7 7 

<6<6 2 (e):=---(l-4e(l-e/2)j . 
Therefore, 

maxFj > 1 - 61(e) =>U > 1 - e =>- maxKj > 1 — 6 2 (e) , 

which implies (14. 2p . The asymptotic relation ( 14. 3 p is an immediate conse- 
quence of (14. 2p . 

For small values a — a max the density fu can increase near its end-point 
u = 1. Then already 

l--Fb(l-e) = O (e a - a ™ ) , £ -> 0, entails /tr(l-e) = O ( £ Q - Qm - - 1 ) , 

which implies (14. 4p . □ 

Now with the Jacobi polynomials Gk{<yn, an; x) from (A. S. 22. 3. 3), be- 
longing to the weight function w(x) = x an_1 ,0 < x < 1, and the shifted 
Legendre polynomials P fc *(x) = P fc (2x - 1) = ( 2 *)G fc (l, (A.S.22.2.11), 
(A.S.22.5.2), we obtain with flZSJ and 



/ (1 - F c (z)) ar " 4 *' -1 ^ 
Jo 



an + j 

the following two representations of the cdf Fjj(u) = 1 — Fc (n _1 / 2 n _1 ): 
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Theorem 1. Let U be defined as in (11.31) . Then 

F v {u) = (4.6) 



oo 

D 



k=0 x ' \j=0 



-man + *) ( Qn + 2fc ) ( 53(-dj •*■?. ) g.<<», 



which is uniformly and absolutely convergent on compact intervals [a; b] C 
(1/y/n] 1) under the first condition in (14.5)) . Besides, 



F v (u) = (n^u)"" 1 ■ (4.7) 

f)(-D'(2* + i) fo-iy(*zi^^Wvv') , 

fc=o \i=Q ,y ' ,y ' J J 

which is uniformly and absolutely convergent at least for n>max(2, 1+ j-, j-) . 

Proof. There remains only an explanation for the additional condition 
n > 3/ (2a) in the 2 nd formula. The 2 nd series is directly obtained from 
the corresponding series for the function x an ^ 1 (l — Fc(x)), < x < 1, by 
the substitution x = n' 1 / 2 ^ 1 . The square integrability of the density fc{%) 
on [n~ 1//2 ; 1] is equivalent to the square integrability of fu- The additional 
condition for n is sufficient for the square integrability of x an ~ 2 . Hence, the 
derivative of the above function is square integrable under the given condi- 
tion. □ 

In particular for a — 1, it follows with the moments jj from ( 13 .7p that 



However, it should be noted that the moments = l,...,n — 1, are 
computed by infinite series, whereas the coefficients in the orthogonal series 
i)]) , ( 14.7)) are given by finite calculations only. This is important since a 




high number of digits for the moments 7 an +j is required to obtain sufficiently 
accurate values of Fu. 

From fll.2p we can also obtain the moments 
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where Yu(k) means summation over all decompositions k± + . . . + k n = k, 
fci, . . . , k n G N • Since only partial sums are used in the computing procedure 
( 12.11) . these moments can be computed by the same method, starting with 
the divergent series 

k=0 

The square integrability of the densities fu and fa is equivalent. Therefore, 
we get a further convergent orthogonal series 

1-F,n(x)= (4.10) 



± { -im + 1) (ti- ^-^^ V* 

k=0 \j=0 J ' J ' J / 



x) , n 1 < x < 1 



but the extreme magnitude of the sums in (14.91) is unfavourable. 

A generalization of the orthogonal series in this section to U computed 
from n completely independent gamma random variables with not identical 
order parameters is feasible. However, the required moments would have to 
be computed from a product of n generating power series instead from just 
an n th power. 



5 Integral representations of the cumulative 
distribution function of U 2 

Under the 2 nd condition in (14. 5 p the density of U% — n^ 1 vanishes at its 
end-points and is of bounded variation. Then the cdf is representable by an 
integrated Fourier-sine series , i.e. 

r 9 1^2 / j . ,\ 1 - cositux) kn 
PiUl-n- 1 < x} = t k = y^zt, (5.1) 

k=l 

which is absolutely und uniformly convergent with terms at least of order 
o(k~ 2 ). 

In a similar way a series for P {log(y/nU) < x} can be derived from (13.61) . 
replacing s by it k = ikn/ log(i/n). However, an accurate computation of the 
coefficients is difficult for large k. Also a direct use of the Fourier inversion 
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formula is not recommended because of the comparatively slow decrease of 
the cf of \og(y/nU). 

Before we return to (15. ip a univariate integral representation of P{U 2 < x} 
is given by means of parabolic cylinder functions _D_ Q (A. S. 19. 3.1). 

Theorem 2. Let X\, . . . ,X n be completely independent random variables 
with gamma densities (T(aj))~ 1 e~ x x aj ~ l , a = a\ + . . . + a n and U 2 = 
(Yh=i x i)~ 2 Ya=i x f- Then the cd f °f U 2 is given by 

p{U 2 n <x} = n- 3 / 2 V2T(a)x ia - 1)/2 - 

^"^((n^i e^(-^) J D_ aj (^))e^(g) J D_ a (v / ^sa;- 1 / 2 )v / z)ci S , 
at least for a > 1 . 

Remarks: The theorem should be true for all a > 0, but the additional 
condition implies the absolute convergence of the integral and facilitates the 
proof. The cdf of Greenwood's statistic is obtained by identical aj = 1 and 
a = n. For the relation of D^i to the error function see (13.21) . Splitting the 
integral into integrals over intervals between successive zeros of the integrand 
provides an alternating series which enables a good control of the remainder 
in many cases. E.g. by integration over the intervals (0; 2.978235860548) 
and (2.978235860548; 3.004569229995) we find the inequalities 

0.98999999977 < P{60£/ 6 2 < 2.7167772982} < 0.99000000060 . 

With increasing n a higher computing accuracy is required to compensate 
the extinction of the leading digits, in particular for the upper tail of this 
distribution. 

Proof. With (A.S. 19.3.1), (A.S.19.12.3) and (A.S. 13.5.1) we find for all po- 
sitive a that 

D_ a {iV~is) = 0( s max (~ a ' Q - 1 )) , s oo , (5.2) 

exp(— is 2 / A)D - a (\f^As) ~ exp(ia7r/4)s~ Q , s — Y oo , (5.3) 
(see also (A.S. 19.8.1)) . 

Hence, the above integrand is absolutely Ois o^i/a M where k = 

#{aj\ctj > 1/2}, and the integral is absolutely convergent if max <x,- > 
1/2 or a > 1/2. 
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Moreover, to justify a change of the order of integration in (I5.6P according 
to Fubini, 

J" (n™=i \D- aj (iVi8)\) \D^ a (x- 1/2 sV^t)\x^- 1)/2 e- tx dsdx = 
J (n;=i \D^.(iVis)\) iD^x-^sV^)^ x^-We-^dx + 

jo, sjoc \ D _ a ^ is ^^ \D- a (sV=i)\ds\ x a ' 2 e- tx dx < oo 

is verified by (15. 2 j) and (15 .3p . 
The Lt of 

— ^expf )D_ a (x 1/2 se 

with regard to x is given by 

exp (-se-^v 7 ^) (2t)~ (a+1)/2 , t > , < < vr/4 . (5.4) 

Due to dominated convergence this relation holds also for the limit 4> = 7r/4, 
i.e. 



has the Lt exp(isv / 2it)(2t)" (a+1)/2 . (5.5) 



Therefore, 

x (a-l)/2 



2tt 



has the Lt (5.6) 
^Il exp (~T) D -«i( iV * s )j exp(^ SV / 2^)(2t)- (Q+1)/2 v / zcis. 
By (A. S. 19. 12. 3) in conjunction with (A.S.13.1.5) we obtain 

(2t)-^ exp (^^) D_ a = O (|1 + < a |-«) , a oo , (5.7) 

13 



since 3^(1 + zs) 2 ) < if \s\ > 1. 

From the Lt in (13.1 p we get the Lt /y Z (s,t) of the joint density fy,z of 
Y = J2j=i Xj and Z = YTj=i , and find by the Laplace inversion formula 
with regard to s the Lt of fy,z with regard to Z as 



e 



i /* — - — |-ioo 

v / (ri e ' 2/4 ^(o)(2t)- (a - i)/2 e^rfc 

/ ( II e ~ S ' 2/4D "> NJ (2t)- (Q " 1)/2 e iS ^^ . 



Then the Lt of the conditional density of ZY 2 \Y = y is 

r(a) 



/OO " 
( n e" s2 / 4 D_ Qj (is)) (20- (Q - 1)/2 e is ^ S = 
oo i=1 

7^/°°^ ^(n e " s2/4 ^-^( is ))( 2 *)~ (Q_1)/2eia ^ 



2tt 



independent of Hence, the Lt of the cdf of U 2 is 

h{a) J™K ^(ne- s2 / 4 D_ Qj (* S ))(2t)-( Q+1 )/ 2 e^ ds = 

(5.8) 

h{a) J™K (^f[e- ls2 / 4 D_ aj (iVis)y2t)-^^ ds . 

Now, theorem 2 follows by comparison of ( 15. 8 p with (15. 6p . □ 

With a sufficient number of terms the partial sums of the integrand in 
the following theorem show a rather smooth behaviour within the "main 
part", also for larger n. The part of the integrand oscillating around zero 
becomes absolutely small and is more distant from zero. Its contribution 
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to the integral becomes neglectable. For simplicity we give here only the 
formula derived from identically distributed gamma random variables. The 
generalization to different order parameters ay is obvious. 

Theorem 3. Let X\, . . . , X n be i.i.d. random variables with a gamma density 
(Y(a))~ l e- X x a - 1 and U 2 = (£™ =1 X t )' 2 £™ =1 X?. Then the cdf of U 2 - n~ l 
is given by 



P{Ul-n- 1 < x) 

/oo Jv 
y 3 {(ip(s, tk)) n exp (— ians — i(n — l) _1 /c7r) } 

ds, (5.9) 



T(an + V 

n 2 (an/e) an k^L^_^ ; | 
1 — cos((l — n~ 1 )~ 1 k7rx) 



k 



I \ I 1(1 — IS) \ „ / /-I— IS 



V>(M) = [ttA exp r^—P- )DJVi 



2tJ r \ 8t J "V V2t 

- — — , n > max ( 3, 1 + — ) , < x < + 
a A n(n — 1) \ a J n 

The limit can be taken under the integral at least for 

rm / 1 3 \ 

y >max ^ 2 , 1+ __j. (5 . 10 ) 

In particular with a = 1 and t k = kn/ (n(n — 1)) we have at least for n > 6: 



ft! 



— oo 



7T 2 (n/e) 



fc=i 



i \'' 2 „ / /-l — is\ ( . ( {1 — is) 2 



1 — cos((l — n~ 1 )~ 1 kirx) 



k 



-ds. (5.11) 



Remark: The condition f)5.10p is too restrictive but enables the use of Par- 
seval's identity in the following proof. The absolute integrability of the cf 
fy,z is sufficient for the application of the Fourier inversion formula but not 
necessary. 
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Proof. From (13 .ip we obtain the characteristic functions 

and f Y ,z{s,t) = (i>{s,t)) n . 

With 3ft - is) 2 (2t)- 1 ) = sT 1 , t > and an > 1 the absolute convergence 
of 

/ {^{s,t)) n e- isy ds 

J — oo 

follows from (A.S.19.12.3) and (A. S. 13. 5.1) and the absolute convergence of 
(i)(s,t)) n e- isy ds 



is obtained by the asymptotic relation (A. S. 19. 8.1). 

Since the conditional distribution of ZY~ 2 \Y = y doesn't depend on y 
we can choose y = E(Y) = an and obtain the cf of U 2 — n^ 1 : 

/oo 
(ip(s, (an)~ 2 t)) n exp(—ians)ds . 
-oo 

Together with (15.11) this entails (15.91) . 

Now, a sufficient condition for the square integrability of fy,z is given. 
The square integrability of the densities fu and f\j2 is equivalent because of 

i r i 

2 j„ _ / fo..\-if* ^.^^ 2 



(f u i(x)) z dx= / i (2u)- 1 (f u (u)Ydu. 



n~2 



According to (14. 5p fu is square integrable if n > max (2, 1 + (2a) 1 ). From 
the identity of the distributions of U 2 and ZY~ 2 \Y = y it follows 

f i \ d p/7/ aiv \ 2fY,z(y,wy 2 ) 
fu*(w) = ~r p {Z <wy \Y = y\ =y — — 

dw g an {y) 

with g m (y) = (r(an))" 1 exp^y)^" 1 . 
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oo / py 2 



\Jn~ 1 y 2 
oo / py 2 

Wn-'i, 2 



With the additional assumption n > 3/(2a) we obtain 
(fY,z(y, z)f dz^j dy = 

{{g an {y)T l fr,z(y, z)) 2 dz^j {g an {y)f dy = 

y A J ^{(ganiy))' 1 fY,z(y,wy 2 )) 2 dw^j (y~ l g an (y)) 2 dy = 

» 1 POO 

Uu 2 (w)f dw- / (y~ l g an {y)) 2 dy < oo . 
i- 1 Jo 

Thus, due to Parseval's identity, the cf fy,z is square integrable and it follows 

00 / ' oc r ni ( _ 13 



(s, i)| n (is(it < oo at least for 



oo J — oo 



2 



>max(2,l + -,- 



^From (I4.5p it follows also that the density fy vanishes at its end-points for 
these n. 

Then the change of summation and integration in (15.91) is justified by 

V l^ S '* fc ) |n =o( r \iP(s,t)\ n dt], K^oo, and 
k>K k ^ Jt « J 

/OO / POO \ 
f / \ip{s,t)\ n dt) ds < OO . □ 
oo \Jtl J 
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6 Some ^-quantiles of nU^ — 1 

It should be noted that n is the number of squares. In the application for a 
goodness of fit test the sample size is k — n — 1. 
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